ETCH ENDPOINT DETECTION 



Field of the Invention 

This invention relates to control of etch processes in integrated circuit fabrication and, 
5 more particularly, to etch endpoint detection. 

Background 

Fabrication of integrated circuits on semiconductor wafers involves repetitive 
application of processes, such as masking, etching, layer formation, and doping. This invention 
10 primarily concerns the etching process, that is, the process of removing selected portions of 
layers of material during integrated circuit fabrication. 

As the densities and complexities of semiconductor devices have increased, the sizes of 
the various features have decreased, and the progress toward smaller feature sizes has given rise 
3 to a need for improved control over the etching process. It can be particularly desirable, for 

15 example, that the edge profile be straight and vertical and sharply defined. And, for example, it 
Cri can be particularly desirable that the etching process be controllable so that etching can be 

i]i stopped after the portion of the etched layer has been fiiUy etched through, but before an 

^^"^ unacceptable amount of the underlying material has been etched away. 

C 3 Any of various anisotropic etching techniques can provide a sharply defined steep (high 

J'"; 20 aspect ratio) edge profile. Suitable anisotropic techniques include dry etching by ion sputtering, 
'^t and ion assist etching such as, for example, the Lam Research TCP 9400 polysilicon etcher, 

p Some etch techniques are selective to some degree, that is, the techniques remove some 

materials more rapidly than other materials, and an etch technique may be selected that is 
selective for the etched material as compared with the underlying material. Typically, 
25 anisotropic techniques providing an acceptable edge profile are not highly selective for the 

etched material as compared with the material underlying the etched layer, and permitting the 
process to continue etching after a portion of the underlying layer has been results in removal of 
part of the underlying material, which may produce a defect. 

The length of time required to complete an etch of a particular material using a 
30 particular etch technique can be approximated on the basis of experience or by trial. Owing to 
variations in processing, however, different wafers in a process run can in practice have 
significantly different etch rates. An etching strategy that simply allows an etch to proceed for a 
predetermined length of time poses a risk that on some wafers etch will be incomplete or that 
underlying material will be etched. Accordingly, in state-of-the art processing, some provision 
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is typically made for monitoring the etch process to determine when the etch has proceeded to 
the point at which some portion of the underlying layer is exposed. One approach to detecting 
when the underlayer has been exposed is to monitor the plasma emission product; a change in 
the plasma emission product signals exposure of the underlayer, and at that point the process is 
switched to a more highly selective etch technique, often termed an "over etch". 

One approach to etch endpoint detection, useful in processes for etching transparent 
materials such as siUcon dioxide (Si02), or partially transparent materials such as polysiUcon, 
employs interferometry. In this approach a beam of photons at a wavelength to which the 
material to be etched is at least partially transparent, is directed at the layer that is to be etched, 
and a reflected portion of the beam is detected by a suitable photodetector. Some of the beam is 
reflected by the front (upper) surface of the layer to be etched, and some is reflected by the back 
(lower) surface of the layer to be etched. These reflected components interfere optically 
according to the relation 

2d-N(>./n), 

where d is the thickness of the layer along the path of the beam, X is the wavelength of the 
photon beam, and n is the refractive index of the layer material. As will be appreciated, the 
front and rear surfaces of the layer to be etched may be described as interfaces at the upper limit 
of the layer material and between the layer material and the underlying layer. Where there is no 
phase reversal at the interfaces, for integral values of N, that is N = 1, 2, 3 . . ., the reflected 
components interfere constructively, producing intensity maxima; and for half-integral values 
of N, that is, where N = I/2, 3/2, 5/2 . the reflected components interfere destructively, 
producing intensity minima. Where the different refractive indices at the interface result in a 
phase reversal at one of the surfaces, as is the case where the layer to be etched is a polysilicon 
(refractive index about 4) and the underlying layer is a silicon dioxide (refractive index about 
1.5), for example, intensity minima are produced at integral values of N and intensity maxima 
are produced at half-integral values of N. 

During the course of removal of material by the etch, as the thickness d of the layer 
decreases, the intensity of the reflected portion of the beam cycles sinusoidally over time. The 
distance between adjacent intensity maxima is YiiX/n), so that monitoring features of the 
intensity curve during etch can provide measures of the thickness of material removed from the 
etched layer and of the etch rate. A change from the sinusoidal time-course of intensity occurs 
more or less abruptly at the moment the thickness of the layer reaches 0, that is, at the moment 
the etch is complete. One conventional way to attempt to detect etch endpoint is to try to 
recognize, as precisely as practicable, an abrupt change in the shape of the intensity curve. 
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It is possible, at least in principle, to employ interferometry to calculate an etch depth, 
by counting interference fringes at a particular wavelength. In practice, however, the material 
being etched may not be sufficiently transparent at useful wavelengths. Polysilicon, for 
example, absorbs more at shorter wavelengths, so that at interference fiinges from shorter 
wavelengths may not be detectable over the fiill thickness of the layer. Polysilicon is more 
transparent to longer wavelengths, but the resulting interference fringes are farther apart, so that 
etch depth cannot be accurately determined from them. Moreover, the initial thickness of the 
layer to be etched can vary significantly from one wafer to another within a run, and, because 
counting fiinges provides only a measure of etch depth, the thickness of material remaining 
varies significantly as well. 

As the dimensions of features in integrated circuits have decreased, the thicknesses of 
certain of the layers have also become exceedingly small. The dimensions must fall within 
increasingly smaller tolerances, so that even a small degree of etch in the material underlying 
an etched layer, which may result from permitting the etch process to continue even a very 
short time beyond the endpoint, can result in defective performance. A need exists, therefore, 
for increasingly precise etch endpoint detection. 
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Summary 

In one general aspect the invention features a method for determining an endpoint for 
etching a layer by steps of estimating the etch endpoint and, during etch, directing radiant 
energy at two or more wavelengths onto the layer to be etched, detecting the last intensity 
maximum reflected at a first wavelength prior to the estimated etch endpoint, and detecting the 
intensity maximum reflected at a second wavelength first occurring after the last intensity 
maximum at the first wavelength. The layer is at least partly transparent to the longer 
wavelength at the full thickness of the layer, and at least partly transparent to the shorter 
wavelength at layer thicknesses toward the endpoint. That is, the material making up the layer 
is sufficiently transparent to the first wavelength that at least a portion of a component of the 
first wavelength reflected fi*om the back surface is detectable through the full initial thickness 
of the layer, and the material making up the layer is sufficiently transparent to the second 
wavelength that at least a portion of the component of the second wavelength reflected fi-om the 
back surface is detectable toward the end of the etch, so that the second wavelength maximum 
can be detected. The first wavelength is longer than the second wavelength. The endpoint is at 
the point of intensity maximum of the second wavelength or is at a point following an interval 
thereafter. 

In another general aspect the invention featiures a method for determining an endpoint 
for etching a layer having an approximate initial thickness by steps of, during etch, directing 
radiant energy at three or more wavelengths onto the layer to be etched; selecting first, second, 
and third wavelengths; approximating an etch rate from the time interval between a first 
detected intensity minimum and an adjacent intensity maximum reflected at the third 
wavelength, estimating an etch endpoint from the approximate initial thickness of the layer and 
the approximate etch rate; detecting the last intensity maximum reflected at the first wavelength 
prior to the estimated etch endpoint; and detecting the intensity maximum reflected at the 
second wavelength first occurring after the last intensity maximum at the first wavelength. The 
material making up the layer is at least partly transparent to both the first and the second 
wavelength. The first wavelength is longer than both the second wavelength and the third 
wavelength. In some embodiments the third wavelength is longer than the second wavelength. 
The endpoint is at the point of intensity maximum of the second wavelength or is at a point 
following an interval thereafter. 

In some embodiments the etching process forms gates, and the layer xmderlying the 
etched layer is a gate oxide layer. In some embodiments the layer to be etched is a polysilicon, 
which may be doped or not, or is some other material such as an amorphous silicon, for 
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example; and the underlying layer is a dielectric material such as, for example, a silicon 
dioxide. The layer to be etched may further include in some applications, a silicide over the 
polysilicon. 

In another general aspect the invention features a method for etching a layer, by 
performing a main etch using an anisotropic etch method and employing the steps according to 
the invention as described above for determining etch endpoint; stopping the main etch at the 
etch endpoint, and then completing the etch using a more selective finish etch technique. 

The method can be used to particular advantage where the layer underlying the etched 
layer is very thin, and where the thickness of the underlying layer is a critical parameter for 
performance of the device. These circumstances pertain, for example, where the etch is used to 
form gates, and where the underlying layer is very thin, as, for example, a gate oxide layer 
having a thickness in the range firom as thin as about 15 A to as thick as about 40 A. 

In another general aspect the invention features apparatus for carrying out the etch 
endpoint determination and etch method of the invention as described above, including a 
process chamber, a source of light at two or more wavelengths, a light detector sensitive at two 
or more of the source wavelengths, means for directing the light from the source onto the layer 
to be etched in a wafer in the chamber and for conducting reflected light to the detector, and 
signal processing means adapted to determine intensity maxima over the course of the etch at 
two or more wavelengths in the reflected light. 
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Brief Description of the Drawings 

Figs. lA, IB, IC and ID are diagrammatic sketches in sectional view thru a portion of a 
partially processed semiconductor wafer, showing steps in an etch process. 

Fig. 2 is a diagrammatic sketch of apparatus for carrying out the method of the 
invention. 

Fig. 3 is a plot showing interferometric traces made during the course of a polysilicon 

etch. 

Fig. 4 is a plot showing interferometric traces made during the course of a polysilicon 
etch, illustrating one embodiment of the method according to the invention. 

Fig. 5 is a plot showing interferometric traces made during the course of a polysilicon 
etch, illustrating an alternative embodiment of the method according to the invention. 
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Detailed Description 

The invention will now be described in detail, with reference to drawings illustrating 
examples of representative embodiments of the invention. 

As will be appreciated, the drawings are diagrammatic and not to scale; in particular, 
some dimensions are exaggerated for clarity of presentation. 

An etch process is illustrated in Figs. lA, IB, IC and ID, showing diagrammatically by 
way of example a mask etch of portions of a polysilicon layer over a gate oxide layer to form 
polysilicon gates. Referring now to Fig. lA, there is shown a sectional view of a portion of a 
partially processed semiconductor wafer. A gate oxide layer 14 of silicon dioxide is formed on 
a semiconductor substrate 12, and a polysilicon layer 16 is formed over the gate oxide layer 14. 
Photoresist 18 is patterned upon the polysilicon layer according to the desired arrangement of 
gates. 

Typically part of the polysilicon layer at and near the upper surface becomes oxidized 
during processing to this point, and an aggressive "breakthrough etch" may be employed to 
remove the surface polysilicon oxide. Suitable etch techniques for a polysilicon oxide 
breakthrough etch include, for example, use of fluorine-containing species such as, e.g,, CF4 or 
C2F6. Fig. IB shows the appearance of a partially processed semiconductor wafer as in Fig. 1 A, 
after breakthrough etch. 

Following a breakthrough etch, where such is employed, a "main etch" is used to 
remove the bulk of the polysilicon material layer. Fig, IC shows the appearance of a partially 
processed semiconductor wafer as in Fig. 1 A, following main etch. The main etch is performed 
using an anisotropic etch technique, to provide sharply defined edge profiles having a high 
aspect ratio. Suitable etch techniques for a main etch in polysilicon include, for example, use of 
chlorine gas and HBr at a low pressure such as, e,g., 10-20 millitorr. Such anisotropic etch 
techniques are typically poorly selective for polysilicon over silicon dioxide and, to prevent 
etching of the underlying gate oxide layer, the main etch has been terminated at a point at 
which a small amount of polysilicon 17 remains to be removed. That is to say, the main etch is 
stopped before all of the polysilicon has been removed. 

The etch is completed by use of a more highly selective etch technique to remove the 
remaining polysilicon without etching the underlying gate oxide to an unacceptable extent. 
Suitable etch techniques for completing an etch in polysilicon over a silicon dioxide include, for 
example use of oxygen gas plus HBr at higher pressure such as, e.g., at least about 30 millitorr 
and as high as about 80 millitorr. Such selective etch techniques provide a relatively poor edge 
profile, however. The result appears in Fig. ID, showing a suitably high aspect ratio with a 
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well-defined edge profile on much of the resulting gates 15, with a somewhat poorer edge 
profile 13 nearer the gate oxide layer 14. 

A similar process may be employed to etch any of a variety of layer materials to any of 
a variety of underlying material such as silicon oxide or other dielectric material. Because the 
5 more selective finish etch yields a poorer edge profile, the challenge, in general, is to employ 
the anisotropic main etch as far down toward the underlying layer as possible without etching 
into it, in order to minimize the extent of material that must be removed by the finish etch. 

According to the invention, an etch endpoint is determined by estimating the etch 
endpoint, and during etch directing radiant energy at two or more wavelengths onto the layer to 
10 be etched, detecting the last intensity maximum reflected at a first wavelength prior to the 

estimated etch endpoint, and detecting the intensity maximimi reflected at a second wavelength 
first occurring after the last intensity maximum at the first wavelength. 

Apparatus for carrying out the invention is shown by way of example in Fig. 2, 
%i generally at 20. A processing chamber 22 is provided with means for holding the wafers during 

f a 15 processing, means for transferring wafers, and means for exposing the wafers to the etch 
process. These parts of the apparatus can be conventional, and will in practice be selected 
I J according to the user's specifications and according to, for example, the chosen etch technique. 

^" A fiberoptic bundle 24 is configured and arranged to direct light onto a wafer being 

processed in the chamber and to receive light reflected fi*om the wafer. A portion 23 of the 

I n 

■a • 3 

[ij 20 fibers in the fiberoptic bundle 24 are coupled to a broad band light source 26, so that they 

1 J. 

conduct light supplied by the source through the bimdle and onto the wafer. Another portion 25 
O of the fibers in the fiberoptic bundle 24 are coupled to a detector 28, so that they conduct light 

reflected fi-om the wafer through the bundle to the detector, where they reflected light is 
detected. A suitable fiberoptic bimdle can be obtained from, for example. Oriel Instruments. 

25 The light source 26 provides light in at least two wavelengths to which the material of the layer 
to be etched is transparent or partly transparent (semitransparent); it may be preferable to use a 
broad band light source such as a xenon lamp or a halogen lamp for the light source 26. 
Suitable light sources can be obtained commercially fi-om, for example. General Electric. The 
detector 28 is capable of detecting at least two wavelengths reflected fi-om the wafer; it may be 

30 preferable to use a broad spectrum photodetector such as a CCD array or a diode array with a 
number of photon sensitive elements over a range of wavelengths. A suitable CCD array can be 
obtained commercially fi-om, for example. Ocean Optics Corporation. A suitable diode array 
having more than 516 photon sensitive elements, and capable of acquiring fiill spectra ranging 
fi-om 200 nm to 950 nm continuously, can be obtained commercially from, for example. Oriel 
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Instruments. A suitably programmed computer 29 can receive and analyze data from the 
detector 28 and can control the process chamber 22. The computer 29 is provided with 
peripheral devices, including output devices such as display and printer and input devices for 
operator input. 

5 As will be appreciated, any of a variety of means, knovm to the skilled artisan, may be 

employed for directing light in two or more wavelengths from a source onto the layer to be 
etched on a wafer in a process chamber, and for collecting and detecting light in two or more 
wavelengths reflected from the layer. 

The wafer bearing the layer to be etched is positioned in the chamber, and the etch 
10 process is started. As the etch proceeds, light from the source is directed onto the layer being 
etched, and reflected light is detected by the detector. By way of illustration. Fig. 3 shows time 
traces at wavelengths of 450 nm (31), 500 nm (32), 600 nm (33), 700 nm (34) and 800 nm (35) 
f2 during etch of a polysilicon layer that was initially approximately 220 nm thick. Etch was 
initiated at about time 0, and etch was complete after about 50 seconds in this example. As 

CO 15 predicted by the optical interference relation 2d = N(X,/n), and looking particularly at the traces 

ffi 

In for 800 nm, 700 nm and 600 imi, the traces show a generally sinusoidal relation of intensity 

J^" over time, with a greater time interval between interference intensity maxima for longer 

3 wavelengths. The polysilicon used in this example is at least to some degree transparent to all 

the wavelengths shown in Fig. 3 but, because the extinction coefficient is greater for light at 

N 20 shorter wavelengths, oscillations appear in the traces for the shorter wavelengths 500 nm and 

i 5 

□ 450 nm only after some etch time has passed and the polysilicon has thinned significantly. As 

the thickness of the polysilicon layer approaches zero (just before about 50 seconds in this 
example), the traces for all the wavelengths approach intensity minima. 

In the example illustrated in Fig. 3, two interference intensity maxima appear in the 

25 trace at 800 nm, while three maxima appear in the trace at 600 nm. The second maximum in the 
trace at 800 nm occurred about three-quarters of the way into the etch time; that is, the last 
maximum at 800 nm prior to the etch endpoint occurred at a point in the process at which about 
one-foiuth of the thickness of the polysilicon layer still remained to be etched away. Stopping 
the main etch at this point and completing the etch using a selective finish etch technique would 

30 result in an unsatisfactory edge profile. After the second maximum at 800 nm, the next 

detectable maximum at 500 nm, for example, occurred about nine-tenths of the way into the 
etch time; that is, the next maximum at 500 nm following the last maximum at 800 nm occurred 
at a point in the process at which only about one tenth of the thickness of the polysilicon layer 
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remained to be etched away. Stopping the main etch at this point and completing the etch using 
a finishing etch would result in a satisfactory edge profile. 

One embodiment of the method according to the invention is further understood with 
reference to Fig. 4, showing traces generally as in Fig. 3, for 800 nm (45) and 500 nm (42) 
only. First, the etch time is estimated. This may be done either by performing a trial etch on a 
test wafer and inspecting the data as described with reference to Fig. 3, or by derivation or 
calculation based on known or derived parameters such as approximate layer thickness and 
approximate etch rate. Then a longer wavelength is selected. A preferred longer wavelength is 
one to which the material of the layer being etched is relatively highly transparent, and 
preferably is one that is expected to display at least two distinguishable interference intensity 
maxima over the estimated etch time. Then a shorter wavelength is selected. A preferred shorter 
wavelength is one to which the material of the layer is transparent enough to show a 
distinguishable interference intensity maximum at layer thicknesses that would be encountered 
following the last maximum at the longer wavelength. For the material treated in this example, 
a suitable longer wavelength is 800 nm and a suitable shorter wavelength is 500 nm, as 
discussed above with reference to Fig. 3. The shorter wavelength is chosen according to the 
process being used as well as according to properties of the layer being etched. Generally, the 
shorter the second wavelength is, the closer the second wavelength maximxmi will be to the 
point where the underlying layer becomes exposed, that is, the later the endpoint will be. 
Accordingly, where condition of the etch process or of the material being etched are such that 
an earlier endpoint is favored, then a longer second wavelength is desirable. An earlier etch 
may be desired, for example, if the etch front is not flat, as where minitrenches form around the 
edge of the etch; or if composition of the material is not imiform, so that some of the material 
etches more rapidly; or if the film is not smooth. Then the etch is initiated and the illumination 
and detection apparatus is turned on. The next step is to detect the last maximimi at the longer 
wavelength. Where, as in this example, there is some certainty that there can be only two 
maxima prior to endpoint at the longer wavelength, the second maximum is the last maximum 
prior to endpoint. On the other hand, if for example 600 nm were selected as the longer 
wavelength, there is some certainty that there can be only three maxima, and the third 
maximum is the last one prior to endpoint. 

Altematively, to find the last of two or more maxima, a delay time (e.^., 41 in Fig. 4) 
can be defined as an approximate time period at which a minimum (e.g,^ 43 in Fig. 4) before the 
last maximimi (e.^., 44 in Fig. 4) is expected to occur, and, accordingly, the last maximum will 
be the maximum detected following the end of the delay time. The greater the time period 
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between maxima, the greater the tolerance for error around the estimated delay time, and the 
longer the wavelength, the greater the time period between maxima. In this example, 
significantly greater margin of error in the delay time is permitted at 800 nm than at 600 nm, for 
example, and for that reason for this material in a layer of this thickness using the etch 
5 technique employed in this example 800 nm may be a preferred longer wavelength. Once the 
last maximimi is detected, the next step is to detect the next maximum (46 in Fig. 4) at the 
shorter wavelength. The endpoint is at the point where the maximum at the shorter wavelength 
is detected, or is at a point thereafter. The thickness of the layer remaining to be etched at the 
point of the maximum at the shorter wavelength depends upon what the shorter wavelength is, 
10 as discussed above with reference to Fig. 3. An interval following detection of the shorter 
wavelength maximimi may be programmed or may be selected by the operator. 

Another embodiment of the method according to the invention is further understood 
p with reference to Fig. 5. In this embodiment, no operator input of delay time is necessary, as 
}i interference intensity at a third wavelength is employed to estimate delay time automatically. 

m 

CO 15 Referring to Fig. 5, traces for a longer wavelength 800 nm (55 in Fig. 5) and for a shorter 

en 

|r| wavelength 500 nm (52 in Fig. 5) are displayed as in Fig. 4. A third wavelength (53 in Fig. 5), 

Jk! here 600 rmi for illustration, is employed to calculate an approximate etch rate according to the 

s relation 

Etch rate = (1/1 6) X, / 5t, 

^ - 20 where 6t is the time interval between an intensity minimum and an adjacent intensity maximum 

C3 at the third wavelength. As shovm by way of example in Fig. 5, the first detectable minimum 

C3 

and maximum at the third wavelength can be employed in this calculation. The delay time is 
calculated fi*om the approximate initial layer thickness and the calculated approximate etch rate. 
The third wavelength is shorter than the longer wavelength, yet long enough to provide a 

25 detectable minimum and maximum early in the etch cycle, as shown for example in Fig. 5. 

Referring again to Fig. 3, this material is not sufficiently transparent to wavelengths of 500 nm 
or 450 nm to provide any detectable minima or maxima early in the etch process at these 
wavelengths. Although earlier minima and maxima are detectable in the trace at 700 nm, this 
wavelength may be less preferred than 600 nm for this material at this initial thickness because 

30 the first well-discernible maximum is just prior to the desired delay period endpoint. 

In still another embodiment, the initial thickness of the layer to be etched may be 
determined using broad band interferometry by way of the same apparatus as is used during 
etch. The thickness data may be stored and used without operator input. In this embodiment the 
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operator may input a desired thickness for remaining material at endpoint, and the remainder of 
the process may by suitable programming be entirely automatic. 

As will be appreciated, the traces shown in the examples are intended to illustrate the 
principle of the method according to the invention, and the data need not be displayed. 
Moreover, as will be appreciated, only selected portions of the data over the time course of the 
etch process need be employed. It is to be understood that where the method calls for 
determination of the point at which a maximum or a minimum occurs, any of a variety of data 
analysis methods may be employed, which may use any of a variety of mathematical methods, 
as will be well within the purview of the person of ordinary skill. 

Other embodiments are within the following claims. 
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